Tools for graph mining

نویسنده

  • Yiping Zhan
چکیده

Large real-world graphs often show interesting properties, such as power-law degree distributions and very small diameters. Discovering such patterns and regularities has a wide range of potential applications. It could help us with detecting outliers or abnormal subnetworks (such as terrorist networks or illegal money-laundering rings), maximizing e ciency of disease controlling, marketing, forecasting and simulations, to name a few. A graph generating model, the recursive matrix (R-MAT) model is introduced and a method (AutoMAT) is shown for automatically estimating the input parameters for R-MAT in order for it to match a given realworld graph. Using these parameters, the resulting R-MAT graphs are shown to match many properties of real graphs. Also, a set of plots (A-plots) are introduced as original ways for viewing large graphs. Their applications in nding interesting patterns and outliers in real, large graphs are demonstrated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...

متن کامل

A Systematic Mapping Study on the Usage of Software Tools for Graphs within the EDM Community

The field of educational data mining (EDM) has been slowly expanding to embrace various graph-based approaches to interpretation and analysis of educational data. However, there is a great wealth of software tools for graph creation, visualization, and analysis, both general-purpose and domain-specific, which may discourage EDM practitioners from finding a tool suitable for their graph-related ...

متن کامل

Structural graph-based representations used for finding hidden patterns

In graph-based data mining (GBDM) tasks, an accurate data representation is fundamental for finding hidden patterns. However, there does not exist a standard representation to describe structural data because of the specific domain characteristics. Then, different graph topologies could be used as data representation, which is a challenge for GBDM tools. In this paper we explore a methodology f...

متن کامل

Graph-based data mining: A new tool for the analysis and comparison of scientific domains represented as scientograms

The creation of some kind of representations depicting the current state of Science (or scientograms) is anestablishedandbeaten track formanyyearsnow.However, ifweare concernedwith the automatic comparison, analysis andunderstanding of a set of scientograms, showing for instance the evolution of a scientific domain or a face-to-face comparison of several countries, the task is titanically compl...

متن کامل

Analysis of the Time Evolution of Scientograms Using the Subdue Graph Mining Algorithm

Scientograms are a kind of graph representations depicting the state of Science in a specific domain. The automatic comparison and analysis of a set of scientograms, to show for instance the evolution of a scientific domain of a given country, is an interesting but challenging task as the handled data is huge and complex. In this paper, we aim to show that graph mining tools are useful to deal ...

متن کامل

On the Integration of Graph Exploration and Data Analysis: The Creative Exploration Toolkit

To enable discovery in large, heterogenious information networks a tool is needed that allows exploration in changing graph structures and integrates advanced graph mining methods in an interactive visualization framework. We present the Creative Exploration Toolkit (CET), which consists of a state-of-the-art user interface for graph visualization designed towards explorative tasks and support ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004